Cost-Optimal Execution of Trees of Boolean Operators with Shared Streams

نویسندگان

  • Henri Casanova
  • Lipyeow Lim
  • Yves Robert
  • Frédéric Vivien
  • Dounia Zaidouni
چکیده

The processing of queries expressed as trees of boolean operators applied to predi-cates on sensor data streams has several applications in mobile computing. Sensor data must beretrieved from the sensors to a query processing device, such as a smartphone, over one or morenetwork interfaces. Retrieving a data item incurs a cost, e.g., an energy expense that depletes thesmartphone’s battery. Since the query tree contains boolean operators, part of the tree can beshortcircuited depending on the retrieved sensor data. An interesting problem is to determine theorder in which predicates should be evaluated so as to minimize the expected query processingcost. This problem has been studied in previous work assuming that each data stream occurs ina single predicate. In this work we remove this assumption since it does not necessarily hold forreal-world queries. Our main results are an optimal algorithm for single-level trees and a proof ofNP-completeness for DNF trees. For DNF trees, however, we show that there is an optimal pred-icate evaluation order that corresponds to a depth-first traversal. This result provides inspirationfor a class of heuristics. We show that one of these heuristics largely outperforms other sensibleheuristics, including the one heuristic proposed in previous work for our general version of thequery processing problem.Key-words: query processing, boolean operators, energy, scheduling, greedy algorithm, datasharing ∗ University of Hawaii at Mānoa, HI, USA† École normale supérieure de Lyon, France‡ LIP laboratory – CNRS, ENS Lyon, INRIA, UCB Lyon 1§ INRIAhal-00869340,version2-18Oct2013 Exécution de coût optimal d’arbres d’opérateurs booléenspartageant des donnéesRésumé : Le traitement de requêtes, exprimées sous forme d’arbres d’opérateurs booléensappliqués à des prédicats sur des flux de données de senseurs, a de nombreuses applications dansle domaine du calcul mobile. Les données doivent être transférées des senseurs vers l’appareil detraitement des données, par exemple un smartphone. Transférer une donnée induit un coût, parexemple une consommation énergétique qui diminuera la charge de la batterie du smartphone.Comme l’arbre de requêtes contient des opérateurs booléens, des pans de l’arbre peuvent êtrecourt-circuités en fonction des données récupérées. Un problème intéressant est de déterminerl’ordre dans lequel les prédicats doivent être évalués afin de minimiser l’espérance du coût dutraitement de la requête. Ce problème a déjà été étudié sous l’hypothèse que chaque flux apparaîtdans un seul prédicat. Dans le présent travail nous éliminons cette hypothèse qui ne correspondpas forcément à la réalité. Nos principaux résultats sont un algorithme optimal pour les arbresavec un seul niveau, et une preuve de NP-complétude pour les arbres sous forme normale dis-jonctive. Pour les arbres sous forme normale disjonctive, cependant, nous montrons qu’il existeun ordre optimal d’évaluation des prédicats qui correspond à un parcours en profondeur d’abord.Ce résultat nous sert à concevoir toute une classe d’heuristiques. Nous montrons que l’une deces heuristiques a de bien meilleurs résultats que les autres heuristiques et, entre autres, que laseule heuristique précédemment proposée pour le cadre général.Mots-clés : traitement de requêtes, opérateurs booléens, énergie, ordonnancement, algorith-mique probabiliste, algorithme glouton, partage de données hal-00869340,version2-18Oct2013 Cost-Optimal Execution of Trees of Boolean Operators with Shared Streams3

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing Multiple Queries against XML Streams

Processing and querying streams, XML streams in particular, has recently become a widely recognized area of interest both in research and in industry. In contrast to traditional query evaluation for databases, where multiple queries against the same data can be evaluated sequentially, for a streamed environment only the simultaneous execution of multiple queries is feasible, as the sequential e...

متن کامل

OPTIMAL OPERATORS OF GENETIC ALGORITHM IN OPTIMIZING SEGMENTAL PRECAST CONCRETE BRIDGES SUPERSTRUCTURE

Bridges constitute an expensive segment of construction projects; the optimization of their designs will affect their high cost. Segmental precast concrete bridges are one of the most commonly serviced bridges built for mid and long spans. Genetic algorithm is one of the most widely applied meta-heuristic algorithms due to its ability in optimizing cost. Next to providing cost optimization of t...

متن کامل

Complexity Analysis of Tree Share Structure

We investigate the complexity of the tree share model of Dockins et al., which is used to reason about shared ownership of resources in concurrent programs. We obtain the precise Berman complexity for the first-order theory of the Boolean algebra of tree shares with constants, which is STA(∗, 2 O(1) , n)-complete. For the first-order theory over the “relativization” multiplication operator on t...

متن کامل

Parallel Generation of t-ary Trees

A parallel algorithm for generating t-ary tree sequences in reverse B-order is presented. The algorithm generates t-ary trees by 0-1 sequences, and each 0-1 sequences is generated in constant average time O(1). The algorithm is executed on a CREW SM SIMD model, and is adaptive and cost-optimal. Prior to the discussion of the parallel algorithm a new sequential generation with O(1) average time ...

متن کامل

Randomized Boolean Decision Trees: Several Remarks

Assume we want to show that (a) the cost of any randomized decision tree computing a given Boolean function is at least c. To this end it suuces to prove that (b) there is a probability distribution over the set of all assignments to variables of that function with respect to which the average cost of any deterministic decision tree computing that function is at least c. Yao in 11] showed that ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013